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We present a stochastic agent-based model for the distribution of personal incomes in a developing 
economy. We start with the assumption that incomes are determined both by individual labour and 
by stochastic effects of trading and investment. The income from personal effort alone is distributed 
about a mean, while the income from trade, which may be positive or negative, is proportional to the 
trader's income. These assumptions lead to a Langevin model with multiplicative noise, from which 
we derive a Fokker-Planck (FP) equation for the income probability density function (IPDF) and 
its variation in time. We find that high earners have a power-law income distribution while the low 
income groups have a Levy IPDF. Comparing our analysis with the Indian survey data (obtained 
from the world bank website) taken over many years we obtain a near-perfect data collapse onto 
our model's equilibrium IPDF. The theory quantifies the economic notion of "given other things". 
Using survey data to relate the IPDF to actual food consumption we define a poverty index [l], @], 
which is consistent with traditional indices, but independent of an arbitrarily chosen "poverty line" 
and therefore less susceptible to manipulation. 



PACS numbers: 89.65.Gh,02.50.-r 

Poverty has been a feature of all human societies 
throughout time. The underlying cause is the unequal 
distribution of personal incomes which is an emergent 
feature of a free economy, invariably resulting in extreme 
wealth for a few and relative poverty for many. 

Since the work of Pareto Q , the distribution of incomes 
has been known to have a power law tail at the high end 
4 1 . There have been many models of the dynamic process 
3 , @, @, S, E, by which a power-law tail can develop 
for high incomes. Yet in an interlinked economy the low 
income distribution emerges from the same dynamics as 
the high income. 

Significantly less effort has been applied to study the 
distribution of low- incomes, but to study poverty this 
is the critical part of the IPDF. Empirical data shows 
that low-income distributions are not well described by 
a Pareto-style power law with a sharp cutoff, as is typi- 
cally introduced to obtain a normalizable IPDF. Rather 
than curve-fitting to data, we seek to model the most el- 
ementary processes of economic activity, and to find the 
distribution which emerges. 

Such interacting systems are well described using 
methods in statistical physics @, [l(], 17]. Our basic idea 
is to represent each individual as an "agent" , generating 
income through personal labour and trade. We describe 
the income above starvation level, yi{t), of each agent, 
i, with a stochastic dynamical equation which describes 
both labour and trade. It will turn out that trade is the 
crucial feature for high income groups, while labour is 



important for lower income groups. 

We postulate that the time variation of agent income 
has the form of a Langevin equation: 



Myt + r]i(t)yi 



(1) 



Where C represents the rate of increase in income possi- 
ble from labour, My represents the increasing difficulty 
to maintain a high income, and T)i(t), a random variable 
with zero mean, represents the stochastic effects of trad- 
ing. C{t) is a property of the economy as a whole and is 
slowly varying in time. Possible gains from employment 
depend on how the economy as a whole is performing. 
M is a constant which we shall later determine from em- 
pirical data. Note that a non-zero mean for the noise 
term would be equivalent to a smaller value of M, so no 
assumption is being made about net benefits of trade. 

It can be seen that income from labour alone is the 
same for each person, however the value of trading is 
proportional to an individual's current wealth. This mix 
introduces multiplicative noise which is in contrast with 
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previous dynamical approaches 
nomics, producing anomalous diffusion from the noise it- 
self, not fractional dynamics [3, HB| ■ Equation 1 does 
not map on to any well-known physical system, however, 
there is increasin g n eurological evidence for such non- 
linear risk taking HI, ljj ]. 

While it is easy to postulate reasonable-looking intu- 
itive theories for income distribution, there are no known 
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fundamental laws, and so empirical verification is essen- 
tial [2(J. The largest dataset available for personal in- 
come in a developing economy is that collected by the In- 
dian National Sample Survey Organisation (NSSO) cov- 
ering incomes of millions of people for almost 40 years 
Q ■ The same survey reveals the fraction of income spent 
on staple food (cereals). Since food is the absolute min- 
imum necessity for survival, we will base our measure of 
poverty on expenses related to its consumption. 

The raw NSSO data comprises income bands ("expen- 
diture classes") of irregular size, from which we generate 
cumulative income distribution functions (CDFs). Fig- 
ure 1 shows three typical graphs out of 21 surveys across 
more than a million households (household size varies 
from 4-6) between the years 1959-1991. Once scaled to 
match the mean income for each year, there is a remark- 
able data collapse. The inset shows that the IPDF emerg- 
ing from our model is also in excellent approximation to 
this functional form, as we now discuss. 

We assume that the trading decisions are made before 
their outcome is known, which indicates that we should 
use Ito calculus: had we assumed mid-term review of 
trading strategy it would imply Stratonovich calculus, 
which leads to an equivalent equation with rescaled M. 
This leads to a Fokker-Planck equation derived from the 
Langevin model (eq[l]). 



^(y, t) = I- {[(M + 2)y C(t)\ f + J£} (2) 
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In the steady state (C(t) = Co), this would give us the 
income distribution: 



f(y,t-> oo) 
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cxp(-C /y) 



r(M + l) y M + 2 



(3) 



Eigenvalue analysis shows that this solution is stable 
against perturbations. Equation [5] can be analytically 
solved using Laplace transforms to obtain the full time- 
dependent solution as a sum of confluent hypergeomctric 
functions F(a, 6, z) with time-dependent coefficients: 



f(y,i)= ^2 ex P(-u n t) g n (y) 



(4) 



n=0 



where oj n = 2im and 
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Renormalised deflated income 



FIG. 1: Plots of the cumulative distribution functions (CDFs) 
against deflated income for selected years, with inflation inde- 
pendently sourced from the consumer price index (CPI) and 
renormalized to the 1974 mean income in rupees (64.84 INR). 
The green line is our theoretical curve, taking j/< as income 
above a non-zero level below which agents would die of star- 
vation (set at 0.15 in renormalized units). Inset shows the 
IPDF which is the differential of the CDF, evaluated from 
the data by interpolation. The points are the data from the 
NSSO, the lines are our analytic function for the steady state 
distribution, the only fitting parameters are the power-law 
tail exponent M + 2 = 3.6 



and A\ and A2 are constants dependent on initial condi- 
tions. 

Any redistribution measure can be represented by a 
sum of these functions, and from the associated decay 
times the timescale of its effect can be determined. How- 
ever, the data collapse in figure 1 suggests that the relax- 
ation time back to the steady state distribution is rapid. 

There is one free parameter, M, which incorporates 
difficulty of maintaining high income, mean benefit of 
trade and any possible misconception from our choice of 
Ito calculus. M = 1.6 gives in extremely good agreement 
with the NSSO data, with M + 2 = 3.6 describing the 
exponent in the power law tail, Co j^m+i) being the mean 
income. This gives us great confidence that our simple, 
intuitive model does indeed describe the coarse features 
of the Indian economy, and allows us to proceed with our 
model of poverty. 

Whether poverty-reduction measures are regarded as 
successful or not often depends on the precise definition 
of poverty, a semantic which is still argued over. Sen has 
defined the so-called "axioms of poverty" [lj . 

• Given other things, a reduction in income of a per- 
son below the poverty line must increase poverty. 

• Given other things, a pure transfer of income from 
a person below the poverty line to anyone who is 
richer must increase poverty. 

While these seem to be self-evident, they arc based on 
the ill defined notion of "other things" being unchanged. 
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This is troublesome: it implies that we should be deal- 
ing with partial derivative, but does not specify exactly 
which variables should be held constant it is not possi- 
ble simply to increase mean income and "hold everything 
else constant" . Worse, a dynamic system will have some 
non-trivial response to any income reduction of transfer. 

By defining the process underlying the income distribu- 
tion, it becomes possible to define precisely what "given 
other things" means. For example, increasing mean in- 
come corresponds to increasing C in equation 1. A flat- 
rate tax: increasing low incomes, reducing high incomes, 
maintaining mean income, would correspond to a multi- 
plier on the C — My term, more complex tax arrange- 
ments would change its form altogether. The effect of 
such changes will induce both a transient response and a 
steady-state change in IPDF. 

In the context of India, there has been considerable 
debate on the 'true' measure of poverty as the so-called 
"poverty line" estimates remain controversial j2?| yet 
crucial in the empirical literature on poverty analysis. 
The official poverty lines which guide Planning Commis- 
sion policy are based on the nutritional need for calories, 
but these have been criticized for under-estimating 'true' 
poverty in India 21 1. 

Three conventional poverty measures involve defining 
a certain income as the "poverty line" , and counting 

(i) the fraction of the population with incomes below 
it (headcount index, HCI) 

(ii) The mean percentage below the poverty line 
(poverty gap index, PG) 

(iii) The mean percentage squared below the poverty 
line (squared poverty gap index, SPG) 

A difficulty with such measures is to define the 
"poverty line" , a somewhat arbitrary level of income 
which also changes with time due to inflation. The suc- 
cessive definitions of poverty measures above reduce the 
sensitivity of the poverty index to this choice, but do not 
eliminate it, and pathological cases can easily be derived, 
especially in practice where NSSO data is discretized into 
expenditure classes rather than a continuous. 

To define a more robust poverty measure, we apply 
the idea of consumption deprivation (CD) for a specific 
resource 0, 0, [2J]. This uses the fact that expendi- 
ture on cereals is monotonically increasing with income, 
but flattens above a certain income, reflecting the satura- 
tion of demand for cereal once one has sufficient to avoid 
malnutrition. 

Correlating the NSSO income data with that for cereal 
expenditure, we find a good fit to a Monod relationship 



CD(y)=VK/(K + y) 



(8) 



where the parameters V and K are time dependent [23j | . 
Broadly, K can be taken as a "poverty line" which ac- 
counts indirectly for cereal-price inflation as opposed to 
general inflation. V measures the deflated price of ce- 
reals. A more intuitive measure of deprivation is the 



quantity CD(y)/V, which is the fraction of the maxi- 
mum desirable cereal consumption actually consumed by 
someone of income y. 

The advantage of this measure is that it is based on 
people's actual choices, not on the price of an arbitrarily 
chosen "basket of goods". So, for example, increasing 
housing or clothing costs may affect CD even when ce- 
real prices are steady, as income has to be moved from 
one commodity to another to balance the overall bud- 
get. Similarly CD is not affected by changes in the CPI 
due to price shifts of luxury goods purchased only by the 
wealthy. 

Perhaps most importantly, cereal consumption is di- 
rectly measured by the NSSO. This allows us to assign a 
level of poverty to each such NSSO "expenditure classes" . 
By summing this measure, a poverty index based on ac- 
tual consumption deprivation may be evaluated. We refer 
to this as the CD-index of poverty, Pcd- 

Our model allows us to quantify this CD-index of 
poverty. The model definition of the CD-index satis- 
fies the standard axioms of a poverty index [l|, Q , elimi- 
nates the arbitrary "poverty line" , and makes explicit the 
meaning of "given other things" . Using the NSSO data, 
we can fit an analytic form [23j to the ratio of grain ex- 
penditure to income. The CD-index is then defined by 
the relation 



PcD(t) 



K(t) + y 



f(y,t)dy 



(9) 



where parameters V, K are obtained from NSSO data 
while f(y,t) is the solution of equation [2] The income 
data used to parameterize our model is independent of 
the consumption data used to measure CD directly. In 
Fig. [2j we compare the PcD(i) evaluated directly from 
the NSSO consumption data, and indirectly from our 
income-data based model. We also show the PG and SPG 
indices. All indices show poverty declining in time, with a 
peak due to sharp drops in income in the 1960s. However 
the CD-index shows the effect of increasing cereal prices 
between 1978-84 as causing an increase in poverty, an 
effect which cannot be captured in the standard indices. 



Against the CPI-deflated data, we see that mean in- 
comes have generally risen over the last forty years, while 
the relative price of cereals {V(t)) has dropped steadily 
(see Supplementary Material). This helps to reduce 
poverty, although more direct targetting [25[ may be even 
more effective. 

Returning to the stationary IPDF, the power law expo- 
nent M is seen to be a crucial component in quantifying 
the mean income: Cor(M)/r(M + 1). Critically, since 
we have shown that if trading is, on average, beneficial 
rather than neutral, it will reduce M. Small M increases 
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FIG. 2: Plots of the CD-index, a measure of poverty [23| . 
against time. The broken line shows the CD-index as per the 
official head-count poverty index while the continuous line 
shows the poverty index arising from our IPDF. To generate 
the latter we have taken the function C{i) in equation 1 to be 
a piecewise-linear interpolation of the NSSO-measured mean 
income at each round and assumed a relaxation time less than 
a year. Also shown are the poverty gap and squared poverty 
gap indices, with the poverty line set at the official rate 356 
Indian rupees per month. The absolute numbers cannot be 
compared, but the similarity in trends is evident. V(t) and 
K(t) are defined from the NSSO data: graphs of V(t), K(t) 
and C(t) are given in the Supplementary Material. 



both the mean income and the level of inequality - it 
transfers capital from lower to higher income groups. 

This illustrates a problem with Sen's axioms. Rais- 
ing mean incomes "given other things" reduces poverty, 
while transferring income to higher income groups " given 
other things" increases poverty. So, in this worldview, 
the effect of beneficial trade on poverty depends on the 
definition of "other things". Although one can devise 
pathological cases, what we find here is that the effect of 
increasing trade [M ~ 1) is to reduce absolute poverty 
provided the mean income is above the "poverty line" for 
a headcount index or K for Pcd ■ However, it also has the 
effect of increasing measures of "relative poverty" where 
the "poverty line" is a fixed fraction of the mean income. 

In summary, we have postulated a stochastic model for 
the evolution of the income distribution in a developing 
economy. The steady state of the distribution is stable 
and robust, and in excellent agreement with the massive 
NSSO data set for Indian incomes over many years. The 
existence of an underlying probability distribution func- 
tion parameterized by mean income makes it much easier 
to estimate poverty than existing measures such as the 
head-count index. Under this measure the poverty index 
is completely specified by the data, without recourse to 
defining a "poverty line" . Moreover, the measure is less 
susceptible to manipulation by distortions to the income 
distribution around the poverty line: "lifting people out 
of poverty" (just). A major strength of this theory lies 
in its potential power of predicting the response of the 



IPDF, and hence the poverty index, to external effects, 
up to a reasonably close (perturbative) time span. 
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